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ACCESS METHOD 
AND 

RECORDING MEDIUM HAVING ACCESS PROCESSING 

PROGRAM 

Field of Invention 

The present invention relates to an access method and a 
recording medium having an access-processing program therein, 
particularly to a user-level I/O access method and an access- 
processing program in a virtualized multi-context environment. 

Description of t he Related Art 

Fig. 6 shows an explanatory view of a conventional system 1. 
This figure shows an explanatory view of a system where control 
registers of I/O devices are mapped into the conventional kernel 
address space. 

As shown in the figure, in the conventional system 1, control 
registers of an I/O device are set to be accessible only by a kernel 
mode under a general-purpose operating system (OS), thus 
preventing an illegal operation by a user application. A physical 
address space is divided into a kernel address space and a user 
address space. The I/O device can be accessed only by entry 
addresses in the kernel space, which correspond to the I/O device. 
And, when a process A as a user application wants to use the I/O 
device, the process A issues an operation request through a system 
call of the OS. Then, the OS checks validity of the operation 
request in the kernel mode, and thereafter, executes a concrete 
operation to the control registers in the I/O device. 

Moreover, recently, a system has been gradually used, in 
which the control registers of the I/O device are mapped into an 
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address space of a user application, and the user application (user 
process) directly accesses the control registers and reduces an 
overhead of the I/O operations. Fig. 7 shows an explanatory view 
of a conventional system 2. This figure shows a constitution 
5 example of the system where control registers are mapped into the 
user address space. In this conventional system 2, the process A 
maps a physical page for the I/O device into the user address space 
of the process A. From the page for I/O device, the control 
registers of the I/O device can be used. 
10 Next, Fig. 8 shows an explanatory view of a conventional 

% system 3. This figure shows an example of a system where the 

operation request is transferred from the user address space by 
Ci direct memory access (DMA). In order to eliminate or reduce the 

2{ overhead of the system call, it is necessary that control information 

K 15 concerning the I/O device is stored into the pages in the user 
O address space. In this point of view, the conventional system 3 is 

%l the same as the conventional system 2. Note that, if the control 

registers of the I/O device are merely mapped into the user address 
space, there occurs confusion when the control registers are 
20 accessed from the plurality of user applications. For solving this, 
contents of requests for the I/O device can be stored on a memory, in 
order that the I/O device reads the contents of requests and set them 
on the control registers. Specifically, the I/O device updates the 
contents of the control registers by DMA. And if the I/O device 
25 operates so as not to start processing a next content of request 
before the processing for the preceding request is finished (or 
reached to a breakpoint), the confusion of the operations does not 
occur. Moreover, since a memory resource is protected under the 
general-purpose OS, there is no risk of overwriting the contents of 
30 requests by other applications. 

What becomes important in the conventional system 3 is how 
to indicate the location of the operation requests on the memory to 
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the I/O device. In the conventional system 3, a context register 
storing a pointer indicating the location of the requests on the 
memory is provided in the I/O device. The OS grasps which user's 
process is running. And, the OS can rewrite the context register 
5 according to the running process. In the case where a certain 
application (for example, process A) is allocated to the processor, a 
storage location for the operation requests from the concerned 
application is set in the context register. 

In the conventional system 3, a pending register which 
10 notifies the registration of the operation requests is mapped into 
CI the user address space according to each application. The pending 

B! register is used as a flag indicating that a process requires 

ji; operations by the I/O device. In pages for user processes, the 

L r i operation requests are stored. The contents of the requests are, 

Li = 

m 15 for example, an I/O command. If the I/O device is a printer, the I/O 
JL command is a print command, a page-feed command or the like. If 

Lfl the I/O device is the other I/O device than the printer, the I/O 

command is a command as to which device input/output are 
p performed to. When the pending register is accessed, the I/O 

20 device fetches (using DMA) the operation requests from the 

memory location indicated by the context register (the pointer), and 

executes the requests. 

Summary of the Invention 

25 

However, in the conventional system 1, there has been a 
problem that the overhead of I/O operation is very large since a 
series of the kernel programs including the system call must be 
called out for each time when the user application tries to perform 
30 an I/O operation. 

Therefore, the conventional system 2 has been proposed. 
However, in this conventional system 2, multiple applications 



cannot use the I/O device simultaneously. In the example of Fig. 7, 
when the process A is using the I/O device, the process B cannot use 
the I/O device. Moreover, since one application changes the 
contents of the control register of the I/O device solely at its 
discretion, consistency in the operation of the control register 
collapses if multiple applications operate the control register on 
time-sharing basis. Accordingly, in the conventional system 2, an 
I/O resource must be allocated to one application exclusively for 
use. 

Therefore, the conventional system 3 has been proposed as a 
system for solving the problem of the overhead of the system call in 
the conventional system 1 and the problem of the multiple 
allocation of resources in the conventional system 2. However, in 
the conventional system 3, the context register in the I/O device 
becomes a part of a processor context, and for each time when the 
OS performs a context switch, a location pointer for the operation 
requests of an application allocated to the processor must be 
updated. Since the context switch is one of the fundamental 
operations of the OS, it is difficult to execute modification of 
extending the processor context to the existing OS. Still less, the 
processor context cannot be extended by adding a device driver. 
Moreover, in the case where a computer system connecting with an I/O 
device is of a multi-processor configuration, the context registers are 
required in the number equal to that of the processors, and any access 
to the pending register needs to be discernible as to which processor it 
is from. In the case where a control circuit of the I/O device is 
integrated into an LSI, in consideration of multi-processor 
correspondence, many context registers must be prepared in 
advance. This results in a waste of cost in a typical single 
processor or a parallel system with a small number of processors. 

As described above, the user-level access methods to I/O 
devices, which have been heretofore implemented, have the 
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problems to be solved, respectively. Moreover, for example, there 
has been performed the method of mapping control registers for I/O 
operations into a user memory space in order to speed up the access 
to I/O devices at the user level as described above. However, 
5 according to the conventional systems, the multiple applications 
cannot use the devices simultaneously. 

In consideration of the foregoing points, the present 
invention provides an access method enabling multiple applications 
to execute simultaneously I/O (or I/O interface) operations at a low 
10 cost by exploiting a memory management unit, and provides a 
recording medium for recording an access-processing program. 

One of the features of the present invention is that a memory 
alias (a state where multiple physical addresses are allocated to a 
single object) is generated intentionally, and that an accessed 
15 application is identified by a mechanism detecting by which alias 
address it has been accessed. Each alias of the same object is 
aligned to a page-boundary. Since physical memory of the host 
system is managed by a page unit, allocating a different alias for 
each application inhibits an illegal action such as an access to 
20 another alias address. 

According to solutions of the present invention, provided is 
an access method and a recording medium having an access- 
processing program comprising the steps of: 

requesting, by a user process, an open procedure of an I/O 
2 5 device (or I/O interface) to the operating system; 

allocating, by the operating system, a context identifier for 
indicating a request storing area of the process, and further 
mapping a memory page corresponding to the context identifier as 
an accessing address to a pending register indicating that there is 
30 an unprocessed request; 

writing, by the process, contents of operation requests to the 
I/O device into the request storing area; 
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notifying, by the process, the I/O device or the I/O interface 
that there is an unprocessed request by use of the accessing 
address for the pending register; and 

reading, by the I/O device, the operation requests of the 
5 process which are stored at the location related to the context 
identifier in the pending register. 

Furthermore, there is provided a recording medium for 
recording an access-processing program. 

In addition, there is provided an address decoding method 
10 for selecting a physically same object device in a computer system, 
^ the method comprising the steps of: 

01 selecting an object device using only a part of a physical 

I 5 = 
~? 

P address for decoding; 

~: using another part of the physical address as an identifier 

HI 15 for distinguishing multiple physical addresses selecting the object 
p device; and 

%l using the identifier in order to identify an application 

-ij process having accessed the object device. 

l7 Furthermore, there is provided a computer system having an 

20 address decoder selecting a physically same object device, 
comprising: 

means for selecting an object device by use of only a part of a 
physical address for decoding the address; 

means for identifying an identifier by use of another part of 
2 5 the physical address as the identifier for distinguishing multiple 
physical addresses selecting the object device; and 

means for identifying an application process having accessed 
the object device by use of the identifier. 

30 Brief Des cription of the Drawings 



Fig. 1 is a constitutional view of a system relating to the 
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present invention. 

Fig. 2 is an explanatory view of an access method according 
to the present invention. 

Fig. 3 is a flowchart regarding an OS and a process in the 
5 access method according to the present invention. 

Fig. 4 is a flowchart regarding an I/O device in the access 
method according to the present invention. 

Fig. 5 is an explanatory view of a mechanism storing data in 
a pending register. 
10 Fig. 6 is an explanatory view of a conventional system 1. 

Fig. 7 is an explanatory view of a conventional system 2. 

Fig. 8 is an explanatory view of a conventional system 3. 

Description of the Preferred Embodiments 

15 

The present invention provides a novel access method to an 
I/O device at a user level, which solves the foregoing problems. 

In the conventional system 3, if the I/O device can identify 
which application has executed an access to the pending register 

20 indicating that the operation requests not subjected to the I/O 
processing are present on the memory, a low-cost method of 
obtaining the memory location for the operation requests can be 
constituted. In an extreme instance, if sufficient pairs of the 
context registers and the pending registers exist, and if each 

25 application space maps one pending register, then an address for 
reading out the requests can be decided by use of the contents of 
the context registers corresponding to the accessed pending 
registers. In this case, the context registers need not to be 
provided in the I/O device, and areas on the embedded memory in 

30 the I/O device can be sufficiently substituted therefor. Positions of 
the context registers on the memory can be computed from the 
positions of the accessed pending registers. If the context register 
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can be provided as an area on the embedded memory inside the I/O 
device, and if the number of pending registers can be restricted into 
one or a small number, a user-level access method to an I/O device 
with very low cost and very little overhead or non-overhead can be 
5 achieved. 

In the present invention, only one substance of the pending 
register is prepared. However, multiple address aliases of the 
pending register are prepared, and a mechanism is prepared for 
determining from which alias the access is made to the pending 
10 register, thus the application (processor context) that has been 
accessing the pending register is identified. Hereinbelow, 
description will be made for an embodiment of the present 
invention by showing a concrete example. 

Fig. 1 shows an example of a constitutional view of the 
15 system relating to the present invention. 

Fig. 1 shows a system for realizing an access method to the 
I/O device with a low overhead at the user level by utilizing, for 
example, a workstation or personal computer with a standard 
configuration. 

2 0 This system comprises: I/O devices (A) 1-1 and (B) 1-2; a 

processor 2; a cache 3; a system LSI 4; a main memory 5; and an I/O 
bus 6. The I/O devices (A) 1-1 and (B) 1-2 comprise the pending 
registers indicating that unprocessed requests are registered 
therein. These pending registers are mapped into a memory space 

2 5 (or subjected to memory mapping), and are accessed from the 
processor (to be described later in detail). The processor 2 
executes various processing according to the OS in cooperation with 
the cache 3. The cache 3 is a high-speed memory for copying a 
program and data for the purpose of speeding up the processing. 

30 The system LSI 4 controls transmission/receipt of the data 
interactively among the I/O devices (A) 1-1 and (B) 1-2, the 
processor 2 and the main memory 5. The main memory 5 stores 
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the program, the data and the like, such as the operating system 
(OS), various processes as applications and the physical address 
space. And the I/O bus 6 can be configured as an extension bus. 

Next, Fig. 2 shows an explanatory view of the access method 
5 according to the present invention. This figure shows an address 
map of pending registers. 

In this example, processes A to C 210 to 212, a physical 
address space 22 and an I/O device 1 are shown. The I/O device 1 
comprises: a pending register 23; a DMA engine 24; a request 
10 executing register 25; and an embedded memory 26. Moreover, 
pages for process A to C 270 to 272 and memory pages for process A 
to C 280 to 282 are prepared in the physical address space, and the 
requests from the respective processes are stored in request storing 
areas 290 to 292. 

15 Fig. 3 shows a flowchart regarding the OS and the process in 

the access method according to the present invention. And, Fig. 4 
shows a flowchart regarding the I/O device in the access method 
according to the present invention. Hereinbelow, description will 
be made for the access method of the present invention with 

20 reference to Figs. 2 to 4. 

First, the processing regarding the OS and the process will 
be described based on Fig. 2 and the flowchart of Fig. 3. 

Since the OS manages the process, the OS has grasped the 
presence of the processes A to C from the beginning when they were 

2 5 created. What is understood from an open procedure is that the 
process that requested the open procedure to the OS is going to use 
the I/O device 1 later. First, the process for using the I/O device, 
from the viewpoint of process, will be executed as follows. 

In Step S101, a process (for example, process A 210) requests 

30 the open procedure of the I/O device 1 to the OS. At this time, the 
process instructs the OS on an area where its own requests to the 
I/O device 1 will be stored. Next, in Step S102, first during the 
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open procedure, the OS allocates an unused context ID for the I/O 
device 1 (for example, ID = 0) to the process A 210, and maps the 
physical memory page 280 (page 0) corresponding to this ID as an 
logical address for access to the pending register 23 by the process 
5 A 210. The mapped logical address is notified from the OS to the 
process A 210 as a result of the open procedure (system call). 
Moreover, in Step S102, secondly during the open procedure, the OS 
stores a pointer (physical address) to the request storing area 290 
of the process A 210 in the embedded memory 26 in the I/O device 1. 

10 The stored address for requests is set in a location easily 
computable with the context ID allocated to the process A 210. 
That is, a table for retrieving the pointer to the request storing 
area out of the context ID is formed on the embedded memory. 

In Step S103, the process A 210 describes the contents of 

15 operation requests to the I/O device 1 into its own request storing 
area 290. In Step S104, after description of the contents of 
requests, the process A 2 10 notifies the I/O device of presence of the 
unprocessed requests by use of the logical address for the pending 
register 23, which is allocated by the open procedure. To be 

20 concrete, the process A 210 accesses the address of the pending 
register 23. In Step S105, the I/O device reads the request out of 
the request storing area 290. In Step S106, if the process A 210 
has another request to the I/O device 1, Step 103 and Step 104 are 
iterated. In Step S107, when the process A requests a close 

25 procedure of the I/O device to the OS, or when the process A 210 is 
ended, the OS withdraws the physical page for the pending register 
23 allocated to the process A 210, withdraws the context ID for the 
I/O device (to be stored as a blank ID), and clears the entry of the 
pointer to the storing area 290 of the request from process A 2 10 on 

30 the embedded memory 26. 

Next, based on Fig. 2 and the flowchart of Fig. 4, the 
processing regarding the I/O device will be described. From the 
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viewpoint of the I/O device, the processing will be executed as 
follows. 

In Step S201, it is identified from output of a first-in first- 
out (FIFO) memory output that the pending register 23 has been 
5 accessed. In Step S202, since the FIFO output includes the 
context ID, the physical address of the request storing area 290 of 
the process A 210 that has accessed the pending register is 
identified by referring to the table on the embedded memory 26. 
In Step S203, the DMA engine 24 reads out the contents of the 

10 request storing area 290, controls the internal register 25 
according to the contents, and executes the request. In Step S204, 
the I/O device iterates these operations until, for example, the 
power source is cut off. 

The process B and the process C also can use the I/O device 1 

15 in exactly the same manner. Here, it is important from a 
viewpoint of reduction of the overhead that the OS is never 
interposed in Steps S103 and S104. Even if the processes allocated 
to the host processor are switched on time-sharing basis, the 
requesting process is uniquely understood by use of the physical 

20 address to the pending register that includes the context ID. 
Accordingly, the OS needs not to perform any additional operation 
in the event of switching the processes. Note that, in the 
conventional system 3, the context registers must have been 
switched. 

25 Next, Fig. 5 shows an explanatory view of a mechanism 

storing data in the pending register. 

A memory structure on the top of Fig. 5 shows a physical 
address 51 which specifies the I/O device. 

Here, as an example, the physical address 51 is constituted 
30 of 36 bits and includes: a field for function select as page offset in 0 
to 11 th bits; a field for the context ID for process in 12 th to 19 th bits; 
and a field for function select in 20 th to 35 th bits. Moreover, as in 
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the figure, a field for indicating the context ID (e.g., 8 bits in the 
figure) is prepared in a portion higher in order than the page offset 
(e.g., 12 bits or 13 bits in a typical processor). It should be noted 
that the field for the context ID is not used for decoding the address, 
5 and that what is usable for decoding the address of the pending 
register is portions of the field higher in order than the context ID 
and the page offset. The area for the function select indicates a 
location of the pending register. 

The function select in the high-order address must be 

10 decoded (decided uniquely), while the low-order address (offset 
portion) does not have to be decoded. In the case where the offset 
portion is not to be decoded, the pending register is specified only 
by the high-order address, regardless of what kind of pattern the 
offset portion is. Note that, since the memory address can be 

15 allocated to the process only by page unit, other processes are not 
affected if the decoding of the offset portion is omitted. On the 
contrary, when the decoding is not omitted, another register or a 
memory can be allocated on the same page of the pending register. 
In this case, an address of a different bit pattern in the page offset 

20 portion is used. 

Moreover, the context ID field indicates the process using 
the I/O device (for example, any one of the processes A to C). When 
the context ID field has 8 bits, even if the 256 processes issue 
requests for using the I/O device (to open) simultaneously, the 

2 5 context ID field can cope with all requests. 

In the case where a fixed address indicating the location of 
the pending register is stored in the function select field of the 
physical address 51, an address decoder 52 for the pending register 
selects the pending register, and stores the context ID of the 

30 physical address 51 in the pending register 53. Moreover, 
according to needs, data 54 can be stored in the pending register 53 
in correspondence with the context ID. The data 54 can be given 




from, for example, the processor, the main memory, an interface 
and the like (for example, in the event of store/swap access). An 
output 55 of the pending register 53 includes the context ID, and 
occasionally, also includes the data corresponding thereto. 
5 Each application has a specified context ID allocated thereto, 

and maps only the page according to the context ID. Specifically, 
where the application is different, the same pending register is 
accessed by a physical address having a different context ID field. 
Where the I/O device has a capability of holding and interpreting 

10 the context ID field each time when it is accessed, the I/O device 
can clarify as to which application has accessed the pending 
register. The access to the pending register is an access by the 
user application, and there is no limitation to the timing of 
execution. The I/O processing cannot be performed without 

15 synchronizing the operation of the I/O device and the receipt of a 
new request by the pending register. Therefore, the access to the 
pending register 53 must be buffered to the FIFO configuring the 
same. At this time, the contents of access (information telling that 
they are "writing" and "written data" when the contents are 

20 writing) are not only buffered, but the context ID is also buffered 
simultaneously. The address of the area, where the operation 
requests are stored by the application, and the address are 
obtained from the context ID by an appropriate calculation, and the 
I/O device reads out the requests using DMA engine 24, and 

2 5 performs the input/output processing. 

A return-value or interruption can be used to deal with 
overfilling of the FIFO buffer. 

• In the case where the return-value is used to deal with the 
overfilling, if the "written data" is not required, the pending 
30 register is accessed by a load (memory reading-out) instruction. 
And in the case where the FIFO buffer is overfilled, an error code 
(for example, -1) is returned as the return-value. The user 
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application checks the return-value and iterates the access until it 
succeeds. If the "written data" is required, the user application 
delivers the data to the I/O device 1 while receiving the return- 
value by a swap (atomic read-write operation) instruction. 
5 • It takes a long time and a much cost for the processor to execute 
the load instruction or the swap instruction through the extension 
bus for the I/O device in comparison with the store instruction. 
Therefore, in the case where the interruption is used to deal with 
the overfilling of the FIFO buffer, the overfilling of the FIFO buffer 

10 is detected by the interruption in order to enable the access to the 
pending register by the store instruction. If the capacity of the 
FIFO buffer is sufficient and the capability of the I/O device 1 is not 
saturated, then the overfilling of the FIFO buffer does not occur. 
If the overfilling of the FIFO buffer occurs, then, thinking that it 

15 costs inevitably a certain amount of the processing cost, the 
interruption is generated to temporarily store the application 
which have failed to access and the contents of access in the 
pending register by the OS. Thereafter, when the application and 
the contents of access are registerable in the I/O device, they are 

20 processed. 

Note that, the access method according to the present 
invention can be applied to an appropriate access to an interface, 
not being limited to the I/O device. Moreover, the access method 
according to the present invention can be provided as an access- 
25 processing program by a recording medium such as CD-ROM or a 
transmission medium such as the Internet. 

Indus tri al Ap plicab i lity 



30 As described above, according to the present invention, there 

can be provided the access method enabling multiple applications 
to execute simultaneously the I/O operations (or the interface 
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operations) at a low cost by exploiting the memory management 
unit, and the recording medium for recording the access-processing 
program. 



